MS&E 336 Lecture 15: Calibration
نویسنده
چکیده
Calibration is a concept that tries to formalize a notion of quality for forecasters. For example, suppose a weatherman predicts each day whether the it will rain, or be sunny. Typically forecasters will predict such events in terms of probabilities, i.e., “There is a 30% chance of rain.” Given only the outcome that day, it is impossible to judge the quality of such a forecast. However, if we consider all days on which a forecaster said the probability of rain was x%, it is reasonable to expect that the fraction of such days on which it rained is exactly x%. This is precisely the notion of calibration. In this lecture we first define a notion of calibration that is appropriate for the study of games, and then prove that calibration is essentially a generalization of internal regret minimization. Formally, we will show that playing a best response to a calibrated forecast of the opponent is an internal regret minimizing strategy; and that using an internal regret minimizing algorithm, one can easily build a calibrated forecaster. Our presentation is based primarily on the corresponding paper of Foster and Vohra [2]. Throughout the lecture we consider a finite two-player game, where each player i has a finite pure action set Ai; let A = ∏
منابع مشابه
MS&E 336 Lecture 1: Dynamic games
• For each player i, a partition Ii of {h ∈ H : P (h) = i} (i.e., each history occurs in exactly one of the sets in Ii), such that if h, h ∈ Ii, then A(h) = A(h). The collection Ii is the information partition of player i, and a member Ii ∈ Ii is an information set of player i. For any information set I , let P (I) and A(I) denote the player and action set corresponding to the information set. ...
متن کاملMS&E 336 Lecture 11: The multiplicative weights algorithm
This lecture is based on the corresponding paper of Freund and Schapire [2], though with some differences in notation and analysis. We introduce and study the multiplicative weights (MW) algorithm, which is an external regret minimizing (i.e., Hannan consistent) algorithm for playing a game. The same algorithm has been analyzed in various forms, particularly in the study of online learning; see...
متن کاملMS&E 336 Lecture 14: Approachability and regret minimization
j 6=i Aj . We let ai denote a pure action for player i, and let si ∈ ∆(Ai) denote a mixed action for player i. We will typically view si as a vector in R Ai , with si(ai) equal to the probability that player i places on ai. We let Πi(a) denote the payoff to player i when the composite pure action vector is a, and by an abuse of notation also let Πi(s) denote the expected payoff to player i when...
متن کاملMs&e 336 Lecture 2: a Sequential Entry Game 2 Incomplete Information 2.1 One Entrant
There is an incumbent firm (denoted I), and N possible entrants. At each time step, an entrant faces the incumbent in a two stage game; assume that entrants are faced in reverse numerical order, and we number the stages correspondingly: thus stage 1 is the last stage, and entrant 1 is faced last. ALl players observe all actions taken by players in the past. At stage k, first the entrant decides...
متن کاملQuantification of urinary albumin by using protein cleavage and LC-MS/MS.
BACKGROUND Urinary albumin excretion is a sensitive diagnostic and prognostic marker for renal disease. Therefore, measurement of urinary albumin must be accurate and precise. We have developed a method to quantify intact urinary albumin with a low limit of quantification (LOQ). METHODS We constructed an external calibration curve using purified human serum albumin (HSA) added to a charcoal-s...
متن کامل